Machine Learning in Applied Statistics

نویسندگان

  • Stan Lipovetsky
  • Jong-Min Kim
چکیده

This special issue of Model Assisted Statistics and Applications (MASA) focused on knowing how current machine learning methods can be applied to diverse statistics areas. We have ten papers about the recent machine learning developments and applications, including survey sampling, biostatistics, bioinformatics, genetics, time series analysis, and technology forecasting. The issue starts with a research work “Variance Estimation by Multivariate Imputation Methods in Complex Survey Designs” by Profs. Jong-Min Kim, Kee-Jae Lee, and Wonkuk Kim describing application of modern machine learning methods to survey sampling imputation. Drs. Insuk Sohn, Sujong Kim, Profs. Jae Won Lee, Ja-Yong Koo, and Dr. Junsu Ko present a paper “Identifying Novel NF-kB-Regulated Immune Genes in the Human Genome using a Discrete Kernel Structured Support Vector Machine”. This paper develops a Discrete Kernel Structured Support Vector Machine (DSSVM) and applied this method to the promoters of 58 known NF-kB-target genes to find characteristic patterns of transcription factor binding sites (TFBSs) in their promoters. Profs. Jooyong Shim and Changha Hwang present a paper “Kernel-Based Orthogonal Quantile Regression Model”. This paper proposes a kernel-based quantile regression model that effectively considers errors on both input and output variables. Profs. Hye-Seung Lee and Jeffrey P. Krischer present a paper “A New Framework for Prediction and Variable Selection for Uncommon Events in a Large Prospective Cohort Study”. This paper describes a framework illustrated with an application of featuring high-dimensional variable selection in a large prospective cohort study. Profs. Jong-Min Kim, Jea-Bok Ryu, Seung-Joo Lee, and Sunghae Jun present a paper “Penalized Regression Models for Patent Keyword Analysis”. The authors analyze the patent keywords extracted from the patent documents using ridge regression, least absolute shrinkage and selection operator, elastic net, and random forest. In addition, to show how the research could be applied to real problem efficiently, the authors carry out a case study of Apple technology. Miss Soyeon Park and Prof. Wonkuk Kim present a paper “Multifactor Dimensionality Reduction Method Based on Area under Receiver Operating Characteristic Curve”. The authors explain multifactor dimensionality reduction (MDR) method which is a machine learning algorithm to detect nonlinear interactions, and compare performance of the standard with the modified multifactor dimensionality reduction method in which the best model is selected by the area under receiver operating characteristic curve (ROC) and cross-validation consistency of the area under ROC curve. Drs. Dipankar Mitra and Ranjit Kumar Paul present a paper “Hybrid Time-Series Models for Forecasting Agricultural Commodity Prices”. The authors apply the hybrid methodology namely ARIMA-GARCH and ARIMA-ANN for modelling and forecasting of wholesale potato price in Agra market of India. The comparison of forecast performance among the ARIMA, GARCH, ARIMA-GARCH and ARIMA-ANN hybrid models shows that the hybrid models perform better with respect to minimum of MAPE and RMSE values.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

Comparative Analysis of Machine Learning Algorithms with Optimization Purposes

The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches‎. ‎Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data‎. ‎In this paper‎, ‎a methodology has been employed to opt...

متن کامل

Transparent Machine Learning Algorithm Offers Useful Prediction Method for Natural Gas Density

Machine-learning algorithms aid predictions for complex systems with multiple influencing variables. However, many neural-network related algorithms behave as black boxes in terms of revealing how the prediction of each data record is performed. This drawback limits their ability to provide detailed insights concerning the workings of the underlying system, or to relate predictions to specific ...

متن کامل

Forecasting the Tehran Stock market by Machine ‎Learning Methods using a New Loss Function

Stock market forecasting has attracted so many researchers and investors that ‎many studies have been done in this field. These studies have led to the ‎development of many predictive methods, the most widely used of which are ‎machine learning-based methods. In machine learning-based methods, loss ‎function has a key role in determining the model weights. In this study a new loss ‎function is ...

متن کامل

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

Using Machine Learning ARIMA to Predict the Price of Cryptocurrencies

The increasing volatility in pricing and growing potential for profit in digital currency have made predicting the price of cryptocurrency a very attractive research topic. Several studies have already been conducted using various machine-learning models to predict crypto currency prices. This study presented in this paper applied a classic Autoregressive Integrated Moving Average(ARIMA) model ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • MASA

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2017